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ABSTRACT 


-s. 


/ 

The  quality  of  a  product  night  be  determined  by  several  parameters,  each 
of  Which  must  meet  certain  standards  before  the  product  is  acceptable.  In 
this  paper,  a  method  of  determining  whether  all  the  parameters  meet  their  re¬ 
spective  standards  is  proposed.  The  method  consists  of  testing  each  parameter 
individually  and  deciding  that  the  product  is  acceptable  only  if  each  parameter 
passes  its  test.  This  simple  method  has  some  optimal  properties  including  at¬ 
taining  exactly  a  prespecified  consumer's  risk  and  uniformly  minimising  the 
producer's  risk.  These  results  are  obtained  from  more  general  hypothesis  testing 
results  concerning  null  hypotheses  consisting  of  the  unions  of  sets. 
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1.  INTRODUCTION 


In  many  situations,  the  quality  of  a  product  is  determined  by  several 
parameters.  The  product  is  of  acceptable  quality  to  the  consumer  only  if  each 
of  the  parameters  meets  certain  standards.  For  example,  an  upholstery  fabric  must 
meet  standards  for  strength,  colorfastness,  and  fire  resistance.  Based  on  some 
measurements  on  the  product,  the  consumer  must  decide  whether  the  product  is 
acceptable,  i.e.,  all  of  the  parameters  meet  the  standards,  or  unacceptable,  l.e., 
one  or  more  of  the  parameters  do  not  meet  the  standards.  In  making  this  decision 
the  consumer  wishes  to  use  a  rule  which  controls  the  consumer’s  risk  at  a  small 
level . 

If  there  is  only  one  parameter  and  only  one  kind  of  measurement,  then  a  stan¬ 
dard  quality  controls  text  such  as  Durr  (1975)  or  Duncan  (1974)  gives  methods  for 
making  this  decision.  Different  methods  are  given  depending  on  whether  the  para¬ 
meter  is  a  mean,  variance  or  proportion  of  defectives  and  on  whether  the  measurements 
are  counts  of  defective  units  (sampling  by  attributes)  or  measurements  on  a  continuous 
variable  (sampling  by  measurements) .  But  no  text  that  the  author  has  found  deals 
with  the  situation  in  which  there  are  multiple  parameters  of  interest. 

This  problem  will  be  formulated  as  a  hypothisis  testing  problem  in  which  the 
null  hypotheses  states  that  one  or  more  of  the  parameters  do  not  meet  their  stan¬ 
dards  and  the  alternative  hypothesis  states  that  all  of  the  parameters  do  meet 

their  standards.  Then  the  probability  of  a  Type  I  error  will  be  the  consumer's 

* 

risk*  Thus  an  o-level  test  will  be  one  which  controls  the  consuaer9s  risk  at 
less  than  or  equal  to  a. 

The  test  proposed  herein  is  so  simple  it  must  not  he  nev.  "ut  t'*o  author  ’’as 
not  been  able  to  find  it  described  in  hypotheses  testing  or  quality  control  literature. 
The  test  is  the  following.  A  hypothesis  test  is  done  on  each  parameter  individually 
at  level  a.  The  overall  test  rejects  the  null  hypothesis  and  decides  that  all  of 
the  parameters  meet  their  standards  If  and  only  if  each  individual  test  decides 
that  the  individual  parameter. meets  its  standard. 
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This  test  has  several  interesting  properties.  First,  the  individual  tests 
are  performed  at  level  o  and  yet  the  overall  test  has  level  a.  Usually  When  doing 
simultaneous  inference  about  many  parameters  (see,  e.g.,  Hiller  (1966))  inferences 
about  individual  parameters  must  be  done  with  an  error  rate  of  less  than  a  to 
achieve  an  overall  error  rate  of  a.  This,  for  example  is  the  basis  of  the 
Bonferroni  method  of  simultaneous  confidence  intervals.  Second,  under  very 
mild  conditions,  the  level  of  this  test  is  exactly  ct.  So  the  test  is  not  being 
too  conservative  by  requiring  each  of  the  individual  tests  to  decide  that  the 
individual  parameter  meets  its  standard.  Third,  under  more  restrictive  conditions 
a  result  of  Lehmann  (1952)  can  be  used  to  prove  that  this  test  is  uniformly  most 
powerful  in  a  reasonable  class  of  tests.  In  terms  of  risks  this  says  that  this 
test  uniformly  minimizes  the  producer's  risk.  These  properties  indicate  that 
not  only  is  the  test  extremely  easy  to  implement,  since  it  deals  with  only  one 
parameter  at  a  time,  but  it  also  seems  to  be  a  reasonably  good  test. 

2.  Basic  Results 

Let  X  *  (Xj,  •••,  Xn)  be  a  random  vector  of  observations  whose  distribution 

is  determined  by  a  vector  parameter  0  *  (6^,»»»,  8^) .  Let  0  denote  the  parameter 

K 

space.  Let  0.,  i  *  1,  •’>•,  K,  be  subsets  of  0.  Let  0n  *  u  0..  Let  K'  denote 
1  i-1  1 

K 

the  complement  of  the  set  A.  Note  that  Q'  •  n  QT .  The  problem  to  be  considered 

0  i*l  1 

is  that  of  testing  H^:  8  e  0Q  vs  Hj:  0  e  0'.  In  the  example  in  the  introduction, 
0^  is  the  hypothesis  that  6^  does  not  meet  its  standard. 


If  0^  must  be  greater  than  in  order  to  meet  its  standard  then  c'^*{0:0JSc1>. 

If  ©^  oust  be  between  and  d^  to  meet  its  standard,  then  ©j«{0:0^£d^  or  O^Sc^}. 

With  this  formulation,  Hq  is  the  hypothesis  that  at  least  one  parameter  does  not  meet 
its  standard  and  is  the  hypothesis  that  every  parameter  meets  its  standard.  Kote 
that  K,  the  number  of  subsets,  may  be  less  than  £,  the  number  of  parameters.  This 
will  be  the  case  if  some  of  the  parameters  are  nuisance  parameters  and  do  uot  have 
standards  associated  with  then. 

Let  a,  OSafil  be  fixed.  For  i*l, •  •  •  ,K,let  i>f(x)  be  an  a-level  test  of  H^tOe©^ 
vs  £:®c©‘’^*i»e. ,  Egi|/^(X)<a  for  all  Let  Y  be  the  test  of  vs  !’  which  re¬ 
jects  if  and  only  if  every  ^  rejects  !!  . 

Other  authors  such  as  Eirnfcaum  (1954).  Birnhaun,  (1955),  Lehmann  (1955)  and 
Spjotvoll  (1972)  have  consider  testing  hypotheses  and  H^.  But  in  all  these 
papers,  except  the  one  result  of  Lehmann  to  be  discussed  in  Section  3,  the  null 
hypothesis  is  of  the  form  E  . 

Tsutakawa  and  llewett  (197C)  propose  the  test  ¥  for  a  problen  comparing  regression 
lines.  Theorems  1  and  2,  which  follow,  are  board  generalizations  of  a  result  they 
prove  about  a  test  based  on  a  bivariate  t  distribution. 

Wilkinson  (1951)  has  proposed  a  test  like  Y  but  in  a  very  different  situation. 
Wilkinson  assumes  that  the  individual  tests  are  a-level  tests  for  all  of  F^,  not  just 
'*01  *  Wilkinson  also  assumes  the  individual  tests  are  independent  which  typically  will 
not  be  the  case  in  the  problems  considered  herein. 

The  facts  that  Y  is  always  an  a-level  test  and  under  mild  conditions  has  size 
exactly  equal  to  a  are  presented  in  the  following  two  theorems. 

Theorem  1.  Y  is  an  a-level  test  of  vs  V  ,  i.e.,  EaY(X)Sa  for  all  0e©ft* 

1  !  -  v.  -  0 

Proof:  Let  Rj  be  the  event  that  ^  rejects  Then  "  -^^R^  is  the  «v««t  that 

Y  rejects  Rq.  Fix  6  e  ©q.  Then  0  e  ©^  for  some  i  and 

Ee  Y  (X)  -  p  0  (P.)  i  p  0  (P^)  -  E0  *t(X)  i  a  since  8  e  Qi  and  ^  is  an  a-level  test  of 


(4) 


Oi* 


I! 


Theorem  2:  Suppose  0^  -  {e-O^Sc^},  i-1,***,  K.  Suppose  the  power  of 
only  on  0i  and  0  Suppose  (X)*a  if  G^-c^-  Suppose  there 
bounds  (possibly  infinite)  such  that  for  any  fixed  values  of  0  +^,***, 
lim  E  ii>  (X)-l.  Then  ¥  has  size  exactly  a,  i.e.,  sun  E  ¥(X)  *  a« 

n  .  i.  9  i  ~  Q,«  0 


^  depends 
are  upper 


Proof :  Let  and  R  be  defined  as  in  the  proof  of  Theorem  1. 

Let  6^  *  (0^,***,  »  1*1»2,»»*  be  a  sequence  of  parameter  points  satisfying 

0li“Cl  for  a11  i’  6li  ^bj»i=2»  ***♦  “»  ~nd  °(K+l)i*  ***»  0£i  arc  fixed  for  all  i- 

Then  04  e  0,  for  all  i  and  PD  (R')  -  1-E„  (X)  -  1-a  for  all  i.  Also,  for 

-I  1  o  1  9^  1 

j«2,  K,  lim  P.  (RT)  -  1-  lim  P_  (R.)  -  1-1-0. 

i-*»  "i  i  i^o  1 

Therefore, 

sup  Efi  ¥(X)  S  lin  Efl  ¥(X) 

0c0n  !  i-*«  -i 


lir 


-i  j=i 


lin  (1  -  P0  (  u  Rp) 


x-*« 


-i  j*l 


2  1  -  lin  S  P  e  <RP 
i~  J-l  2i  j 

K 

-  1  -  lim  ((1-a)  +  £  P  (R-)) 

i-*-  j-2  ®i  J 

-  1  -  (1-a)  -  0  -  a. 


From  Theorem  1,  sup  E  ¥(X)  £  a. 

!*eo  •  • 


Thus  the  size  of  ¥  le  exactly  a.  | | 

The  proof  of  Theorem  2  shows  that  for  the  test  ¥  the  maximum  of  the  consumer's 

risk,  the  maximum  of  the  probability  of  a  Type  I  error,  occurs  when  one  parameter 

just  fails  the  standard,  G  -  c, ,  and  all  the  other  parameters  are  very  good  (large). 

XI 

This  is  exactly  whrt  would  be  expected.  If  many  of  the  parameters  do  not  meet  their 
standards,  it  should  be  easy  to  decide  that  the  product  is  unacceptable. 
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(5) 

But  if  all  the  parameters  but  one  are  very  good  and  the  one  exception  nearly 
meets  the  standard,  it  will  be  most  difficult  to  decide  that  the  product  is 
unacceptable. 

The  conditions  of  Theorem  2  will  hold  if  0^  is  a  normal  mean  and  iJk  is  a  one 
tailed  t-test  or  if  0^  is  the  proportion  of  non-defective  items  and  is  a  one 
tailed  binomial  test.  An  important  case  in  which  the  conditions  of  Theorem  2 
will  not  hold  is  the  case  in  which  0^  is  a  normal  mean,  the  standard  is 

c^  <  0^  <  d^  and  c^  and  d^  are  finite  numbers.  No  a-level  test  will  have  a  power 

approaching  one  on  this  set  of  alternatives. 

3.  An  optimality  Result. 

Lehmann  (1952)  considered  multiparameter  hypothesis  tests.  He  was  primarily 
concerned  with  testing  the  null  hypothesis  Hj  vs  the  alternative  HQ.  But  he  proved 
one  result,  Theorem  4.2,  for  the  situation  in  which  is  the  null  hypothesis. 
Although  Lehmann  did  not  speak  in  terms  of  combining  individual  tests  to  get  an 
overall  test,  his  Theorem  4.2  says  that  under  certain  conditions  the  test  Y  we 

have  proposed  is  uniformly  most  powerful  in  a  certain  class.  The  remainder  of 

this  section  is  a  review  of  Lehmann's  result  in  terms  of  our  notation. 

A  subset  of  A  of  is  called  monotone  if  x  e  A  and  y^  2:  x^,  i  =  1,  K, 

implies  y  e  A.  Suppose  K  =  L.  The  parameter  space  is  the  finite  or  infinite 
open  rectangle  a.  <  0.  <  b. ,  i  =  1,  •  • ' ,  K.  Let  Y1,<">  •  Y.  denote  K  statistics. 

Suppose  Pg(y)  is  the  joint  density  of  the  Y's  and  the  positive  sample  space  of 
the  Y's  is  the  open  rectangle  u^  <  y^  <  v^  independent  of  the  6's.  Suppose  the 
marginal  distribution  of  Y^  depends  only  on  0^  and  Y^  converges  in  probability  to 
v^  as  0^  **•  b^.  Suppose  the  joint  distribution  of  the  Y's  has  the  property  that 
if  §  and  0'  are  two  parameter  points  with  0^  s  0T,  i  *  1,  ■’•’K,  then 


(6) 


i 


P0(Y  e  A)  s  P0„(Y  e  A)  for  every  monotone  set  A.  Let  =  {e  :  6.  s  c.>. 

Let  iJk  be  the  test  which  rejects  HQi  if  Y^^  >  w.^  where  Wj  is  chosen  so  that 

Pg ( Y i  >  w  )  =  a  if  6.  =  c..  Under  these  assumptions  the  following  result 
car  be  proven. 

Theorem  3.  (Lehmann’s  Theorem  4.2). 

Under  the  above  assumptions,  the  test  V  which  rejects  HQ  :  6  e  0Q  if  and  only 
if  each  iJk  rejects  :  9  £  0.  is  uniformly  most  powerful  among  all  non- 
randomized  tests  which  have  monotone  rejection  regions. 

Although  Lehmann  did  not  discuss  nuisance  parameters,  the  results  of 
Theorem  3  will  continue  to  hold  if  the  assumptions  hold  for  each  fixed  set 


of  values  for  0, 


'K+l* 

Lehmann  gives  some  techniques  for  verifying  the  monotonicity  property 
assumed  about  the  joint  distribution  of  Yj ,  •«»,  Y^.  It  will  hold,  for 
example,  if  the  Y's  are  independent  t-statistics  and  the  0^'s  are  normal  means. 


4  AN  EXAMPLE 

An  example  of  specifications  given  in  terms  of  many  parameters  may  be 
found  in  the  textile  industry.  Table  1  lists  specifications  for  upholstery 
fabric  from  the  American  Society  for  Testing  and  Materials.  The  specifications 
give  standards  for  nine  parameters  related  to  strength,  dimensional  stability, 
colorfastness  and  flammability. 

The  first  four  parameters  might  be  assumed  to  be  normal  means;  mean  breaking 
strength,  mean  tear  strength,  etc.  The  first  three  standards  say  the  mean  must 
be  greater  than  some  value.  The  dimensional  change  standard  gives  an  upper  and 
lower  bound  for  the  mean.  These  four  hypotheses  night  be  tested  using  t-tests. 

An  upper  bound  on  the  variance  of  the  dimensional  change  measurements  will  have 
to  be  assumed  in  order  to  construct  an  a-level  test  based  on  a  t-statistics  because 
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of  the  hypothesis'  two  sided  form. 

The  last  five  parameters  might  be  measured  by  binomial  variables,  each 
variable  counting  the  number  of  units  in  a  sample  which  pass  the  corresponding 
test.  Each  parameter  would  then  be  the  proportion  of  units  in  the  population 
(a  particular  manufacturer's  output)  which  achieve  one  of  the  standards.  The 
usual  binomial  test  could  be  used  to  test  each  of  these  five,  one-sided  hypotheses. 

It  would  be  very  difficult  to  posit  a  realistic  multivariate  model  for 
these  nine  variables.  Some  are  discrete  and  some  are  continuous.  Some  are 
likely  to  be  correlated.  Yet  it  is  relatively  easy  to  construct  an  a-level  test 
for  each  parameter  individually. 

These  tests  can  be  combined  into  the  overall  Y  to  test  the  hypothesis  HQ  with 
a  consumer's  risk  of  a.  Theorems  2  and  3  indicate  that,  in  certain  situations, 
the  resulting  acceptance  plan  is  fairly  efficient  in  terms  of  both  producer's 
and  consumer's  risks. 

5.  CONCLUSIONS 

Acceptance  sampling  procedures  for  individual  parameters  are  well  known.  This 
paper  proposes  a  way  of  combining  these  procedures  in  the  situation  in  which  the 
quality  of  a  product  is  measured  by  st  jidards  on  several  parameters.  Not  only  is 
the  method  easy  to  implement  but  it  controls  the  consumer's  risk  at  exactly  a  pre¬ 
assigned  level  in  typical  situations  when  the  standards  are  one-sided  (either  upper 
or  lower  bounds) .  Under  slightly  more  restrictive  conditions  this  method  also 
uniformly  minimizes  the  producer's  risk.  This  method  can  be  used  in  hypothesis 
testing  problems  other  than  acceptance  sampling  if  the  null  hypothesis  is  a  union 
of  sets. 
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ta”Lt:  i 

Standard  Specification  for  Woven  Upholstery  Fabric 
_ Plain,  Tufted  or  Flocked _ 


Test 

Minimum  Standard 

Breaking  Strength 

50  pounds 

Tongue  Tear  Strength 

6  pounds 

Surface  Abrasion  (heavy  duty) 

15,000  cycles 

Dimensional  Change 

5%  shrinkage,  2%  gain 

Colorfastness  to: 

Water 

class  4 

Crocking 

Dry 

class  4 

Wet 

class  3 

Light-40  AATCCF  Fading  Units 

class  4 

Flammability 

Pass 

Source:  1978  Annual  Book  of  ASTH  Standards.  American  Society  for  Testing 
and  Materials,  Philadelphia,  Part  32,  page  717. 
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